TUSQ: Targeted High-Utility Sequence Querying
نویسندگان
چکیده
Significant efforts have been expended in the research and development of a database management system (DBMS) that has wide range applications for managing an enormous collection multisource, heterogeneous, complex, or growing data. Besides primary function (i.e., create, delete, update), practical impeccable DBMS can interact with users through information selection, is, querying their targets. Previous algorithms, such as frequent itemset sequential pattern (SPQ) focused on measurement frequency, which does not involve concept utility, is helpful to discover more informative patterns. To apply technology wider applications, we incorporate utility into target-oriented SPQ formulate task targeted utility-oriented sequence querying. address proposed problem, develop novel algorithm, namely high-utility (TUSQ), based two upper bounds (suffix remain terminated descendants utility) well vertical last instance table. For further efficiency, TUSQ relies projection utilizing compact data structure called chain. An extensive experimental study conducted several real synthetic datasets shows algorithm outperformed designed baseline terms runtime, memory consumption, candidate filtering.
منابع مشابه
Declarative Querying for Biological Sequence Databases
The ongoing revolution in life sciences research is producing vast amounts of genetic and proteomic sequence data. Scientists want to pose increasingly complex queries on this data, but current methods for querying biological sequences are primitive and largely procedural. This limits the ease with which complex queries can be posed, and often results in very inefficient query plans. There is a...
متن کاملObjectRunner: Lightweight, Targeted Extraction and Querying of Structured Web Data
We present in this paper ObjectRunner, a system for extracting, integrating and querying structured data from the Web. Our system harvests real-world items from template-based HTML pages (the so-called structured Web). It illustrates a two-phase querying of the Web, in which an intentional description of the targeted data is first provided, in a flexible and widely applicable manner. ObjectRunn...
متن کاملA New Algorithm for High Average-utility Itemset Mining
High utility itemset mining (HUIM) is a new emerging field in data mining which has gained growing interest due to its various applications. The goal of this problem is to discover all itemsets whose utility exceeds minimum threshold. The basic HUIM problem does not consider length of itemsets in its utility measurement and utility values tend to become higher for itemsets containing more items...
متن کاملA High Performance XML Querying Architecture
Data exchange on the Internet plays an essential role in electronic business (e-business). A recent trend in e-business is to create distributed databases to facilitate data exchange. In most cases, the distributed databases are developed by integrating existing systems, which may be in different database models, and on different hardware and/or software platforms. Heterogeneity may cause many ...
متن کاملSemantic High Level Querying in Sensor Networks
The quick development and deployment of sensor technology within the general frame of the Internet of Things poses relevant opportunity and challenges. The sensor is not a pure data source, but an entity (Semantic Sensor Web) with associated metadata and it is a building block of a “worldwide distributed” real time database, to be processed through real-time queries. Important challenges are to...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Big Data
سال: 2023
ISSN: ['2372-2096', '2332-7790']
DOI: https://doi.org/10.1109/tbdata.2022.3175428